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Abstract 

We revisit the traveling salesman problem with neighborhoods (TSPN) and present the first 
constant-ratio approximation for disks in the plane: Given a set of n disks in the plane, a TSP 
tour whose length is at most 0(1) times the optimal can be computed in time that is polynomial 
in n. Our result is the first constant-ratio approximation for a class of planar convex bodies of 
arbitrary size and arbitrary intersections. 

In order to achieve a 0(l)-approximation, we reduce the traveling salesman problem with 
disks, up to constant factors, to a minimum weight hitting set problem in a geometric hyper¬ 
graph. The connection between TSPN and hitting sets in geometric hypergraphs, established 
here, is likely to have future applications. 

Keywords: Traveling salesman problem, minimum weight hitting set, approximation algo¬ 
rithm. 


1 Introduction 

In the Euclidean Traveling Salesman Problem (ETSP), given a set of points in the Euclidean space 
d > 2, one seeks a shortest closed curve (a.k.a. tour) that visits each point. In the TSP with 
neighborhoods (TSPN), each point is replaced by a point-set, called region or neighborhood^ and the 
TSP tour must visit at least one point in each region, i.e., it must intersect each region. The oldest 
record that we could trace of this variant goes back to Arkin and Hassin [2]. Since the Euclidean 
TSP is known to be NP-hard in for every d>2 [m |22l 00], TSPN is also NP-hard for every 
d >2. TSP is recognized as one of the corner-stone problems in combinatorial optimization. Other 
related problems in geometric network optimization can be found in the two (somewhat outdated) 
surveys by Mitchell [32l |33] . 

It is known that the Euclidean TSP admits a polynomial-time approximation scheme in 
where d = 0(1), due to classic results of Arora jS] and Mitchell [31]. Subsequent running time im¬ 
provements have been obtained by Rao and Smith mi; specifically, the running time of their PTAS 
is 0(/(e) nlogn), where /(e) grows exponentially in 1/e. In contrast, TSPN is generally harder 
to approximate. Typically, somewhat better approximations are available when the neighborhoods 
are pairwise disjoint, or fat, or have comparable sizes. We briefly review some of the previous work 
concerning approximation algorithms for TSPN. 
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Related work. Arkin and Hassin [2] gave constant-factor approximations for translates of a 
connected region, and more generally, for neighborhoods of pairwise parallel diameters, where the 
ratio between the longest and the shortest diameter is bounded by a constant. Dumitrescu and 
Mitchell |16j extended the above result to connected neighborhoods with comparable diameters. 
Bodlaender et al. [8] described a PTAS for TSPN with disjoint fat neighborhoods of about the 
same size in M'^, where d is constant (this includes the case of disjoint unit disks in the plane). 
Earlier Dumitrescu and Mitchell |16) proposed a PTAS for TSPN with fat neighborhoods of about 
the same size and bounded depth in the plane, where Spirkl jUj recently reported and filled a gap; 
see also a follow-up note in [36] . 

Mata and Mitchell [29] gave a 0(logn)-approximation for TSPN with n connected and arbitrar¬ 
ily intersecting neighborhoods in the plane; see also [7|. Elbassioni et al. [I9| and Gudmundsson and 
Levcopoulos [23| improved the running time of the algorithm. The 0(logn)-approximation relies 
on the following early result by Levcopoulos and Lingas [28|: Every (simple) rectilinear polygon P 
with n vertices, r of which are reflex, can be partitioned in 0(nlogn) time into rectangles whose 
total perimeter is logr times the perimeter of P. 

Using an approximation algorithm due to Slavik [33] for Euclidean group TSP, de Berg et al. |6] 
obtained constant-factor approximations for disjoint fat convex regions (of arbitrary diameters) in 
the plane. Subsequently, Elbassioni et al. m gave constant-factor approximations for arbitrarily 
intersecting fat convex regions of comparable size. Preliminary work by Mitchell [M] gave a PTAS 
for planar regions of bounded depth and arbitrary size, in particular for disjoint fat regions. Chan 
and Jiang [12] gave a PTAS for fat, weakly disjoint regions in metric spaces of constant doubling 
dimension (combining an earlier QPTAS by Chan and Elbassioni [TT] with a PTAS for TSP in 
doubling metrics by Bartal et al. a)- 

Disks and balls are undoubtedly among the simplest neighborhood types [ailSlES]. TSPN for 
disks is NP-hard, and it remains so for congruent disks, since when the disk centers are fixed and 
the radius tends to zero, the problem reduces to TSP for points. Regarding approximations, the 
case of congruent balls is relatively well understood: Civen a set of n congruent (say, unit) balls 
in a TSP tour whose length is at most 0(1) times the optimal can be computed in polynomial 
time, when d is constant [18]. However, for disks of arbitrary radii and intersections, no constant- 
ratio approximation was known. Some of the difficulties with disks of arbitrary radii in the plane 
where uncovered in Ini- 

Recent work of Dumitrescu and Toth [18] focused on unbounded neighborhoods, such as lines 
or hyperplanes: They gave a constant-factor approximation for TSPN with n hyperplanes in 
in 0{n) time; and a 0(log^ n)-approximation for n lines in in time polynomial in n, where d is 
constant. In contrast, the current paper considers TSPN with arbitrary disks in which requires 
quite different approximation techniques and new ideas. 

Degree of approximation. Regarding the degree of approximation achievable, TSPN with arbi¬ 
trary neighborhoods is generally APX-hard pea 132], and it remains so even for segments of nearly 
the same length [19] . For disconnected neighborhoods, TSPN cannot be approximated within any 
constant ratio unless P = NP m- Further, approximating TSPN for (arbitrary) connected neigh¬ 
borhoods in the plane within a factor smaller than 2 is NP-hard [42] . Delineating the class of 
neighborhoods for which constant-factor approximations are possible remains mysterious, at least 
at the moment. It is conjectured that approximating TSPN for disconnected regions in the plane 
within a O(log^^^n) factor is intractable unless P = NP [32]. Similarly, it is conjectured that 
approximating TSPN for connected regions in within a 0(log^^^ n) factor and for disconnected 
regions in within a 0(log^/^n) factor [32] are probably intractable. 
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Our results. In this paper we present a polynomial-time (deterministic) algorithm that, given 
a set of n disks in (with arbitrary radii and intersections), returns a TSP tour whose length is 
0(1) times the optimal. 

Theorem 1. Given a set of n disks in the plane, a TSP tour whose length is at most 0(1) times 
the optimal can be computed in time polynomial in n. 

In their seminal paper on TSPN, Arkin and Hassin [2] suggested disks as the most natural type 
of neighborhood—in which the traveling salesman can meet each potential buyer at some point 
close to the respective buyer’s location. Here the radius of each disk indicates how much each 
potential buyer is willing to travel to meet the salesman. While constant-ratio approximations for 
disks of the same (or comparable) radius [2l|T6] and for disjoint disks of arbitrary radii [6] have been 
obtained early on, the case of disks with arbitrary radii and arbitrary intersections has remained 
open until now. 

2 Preliminaries 

We achieve a 0(l)-approximation for TSP with disks by reducing the problem, up to constant 
factors, to a minimum weight hitting set problem in a geometric hypergraph, for which a constant- 
factor approximation algorithm was found only recently m- 

Hitting sets. Hitting sets are defined in general in terms of hypergraphs (i.e., set systems or 
range spaces). A hypergraph is a pair Q = {V,E) where P is a finite vertex set and C 2^^ is a 
finite collection of subsets of V (called edges). In a geometric (primal) hypergraph, the vertex set 
P is a finite set of n points in Euclidean space and all sets in E are of the form V H Q where 
Q is a certain geometric shape of bounded description complexity, e.g., halfspace, ball, triangle, 
axis-aligned rectangle, etc. Geometric hypergraphs often have nice properties such as bounded 
VC-dimension or bounded union complexity; see [20l[39] . 

A hitting set in a hypergraph Q = (P, E) is a subset of vertices El CV such that every hyperedge 
in E contains some point in H. The minimum hitting set (MHS) problem asks for a hitting set of 
minimum cardinality in a given hypergraph. The minimum weight hitting set (MWHS) problem 
asks for a hitting set of minimum weight in a given hypergraph with vertex weights re : P —)• M+. 

Bronnimann and Goodrich [9| gave a 0(log OPT)-approximation for MHS in geometric hyper¬ 
graphs using LP-relaxations and the fact that geometric hypergraphs have bounded VG-dimension. 
Glarkson and Varadarajan HH gave a 0(loglogn)-approximation for some geometric hypergraphs, 
by observing a connection between hitting sets and the combinatorial complexity of the union of 
the corresponding geometric objects. Mustafa and Ray [HH] gave a PTAS for MHS with disks and 
pseudo-disks in the plane using a local search paradigm; see also [U [TO]. However, this method 
does not seem to extend to the weighted version (MWHS). 

Varadarajan |l5| gave a 0(loglogn)-approximation for MWHS, extending the results from [14] . 
His approach was further extended by Ghan et al. m who obtained a randomized polynomial-time 
0(l)-approximation algorithm for MWHS in geometric hypergraphs of linear union complexity, in¬ 
cluding geometric hypergraphs defined by disks in |26| ; their algorithm can be derandomized m 
Section 3]. Specifically, we use the following result due to Chan et al. [T3] . 

Theorem 2. |T31 Corollary 1.4 and Section 3] There is a polynomial-time (deterministic) 0(1)- 
approximation algorithm for the minimum weight hitting set problem for disks in 
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Definitions. Let 7^ be a set of regions (neighborhoods) in An optimal TSP tour for TZ, 
denoted by 0PT(7^), is a shortest closed curve in the plane that intersects every region in TZ] when 
TZ is clear from the context, 0PT(7^) and OPT are used interchangeably. 

The Euclidean length of a curve 7 is denoted by len( 7 ). Similarly, the total (Euclidean) length 
of the edges of a geometric graph G is denoted by len(G). The perimeter of a polygon P is denoted 
by per(P); the boundary and the interior of a region R are denoted by dR and R° , respectively; 
the convex hull of a planar set S is denoted by conv(S'). 

The distance between two planar point sets 5i,iS'2 C M^, is dist(S'i, iS' 2 ) = inf{dist(si, S 2 ) : 
Si G 51,52 G 52 }. The distance between a point set Si and a geometric graph G is defined as 
dist(5i, G) := dist(5i, Sg), where Sq is the set of all points at vertices and on the edges of G. 

Algorithm Outline. Given a set 5 of n disks in Euclidean plane, we construct a connected 
geometric graph G that intersects every disk in S and such that len(G) = O(len(0PT)). An 
Eulerian tour of the multi-graph obtained by doubling each edge of G visits each disk and its 
length is 21en(G) = O(len(0PT)), as desired. 

The graph G is the union of three geometric graphs, Gi, G 2 and G 3 . The graph Gi is a 
0(l)-approximation of an optimal tour for a maximal subset of pairwise disjoint disks in 5; this 
step is based on earlier results ini in] (Section]^. The graph G 2 connects Gi to nearby disks 
that are guaranteed to be at distance at most len(OPT)/n from Gi. The graph G 3 connects any 
remaining disks to Gi] this step is based on recent results on minimum weight hitting sets due to 
Chan et al. [13] (Section]^. 

The interface between TSP and the hitting set problem is established by a quadtree subdivi¬ 
sion [5l Ch. 14]. Previously, Arora |3| and Mitchell |35| used quadtrees for approximating Euclidean 
TSP and TSP with disjoint neighborhoods, respectively. The quadtree variety that we need, a so- 
called stratified grid, was introduced by Mitchell |35| for certain orthogonal polygons. Here we 
define stratified grids in a more general setting, for arbitrary geometric graphs (Section]^. 

3 Preprocessing 

Let 5 be a set of n disks in the plane. The algorithm first constructs the graphs Gi and G 2 as 
follows (Pig. 0. 

1. Select an independent subset I, I C S, of pairwise disjoint disks by the following greedy 
algorithm: Set / := 0. Consider the disks in S in increasing order of radius (with ties broken 
arbitrarily), and successively place a disk D € S into I if it is disjoint from all previous disks 
in I. 

2. Compute a constant-factor approximate TSP tour ^0 for I using the algorithm in [ 6 | or m- 
(A PTAS for disjoint disks in the plane is available |16l I44j but not needed here.) It is clear 
that len(^o) = 0(len(0PT)). 

3. Let Rhe a minimum axis-parallel square such that conv(i?) intersects every disk in S (i.e., 
every disk intersects dR or is contained in R°). The square R is determined by up to 3 disks 
in S, thus R can be trivially computed in O(n^) time: there are O(n^) squares that pairs 
and triples define, and each can be checked in 0{n) time as to whether it intersects all disks. 
Alternatively, finding R is an LP-type problem of combinatorial dimension 3 that can be 
solved in 0{n) time [30] [Section 5]; see also [23| for a modern treatment of LP-type problems 
and violator spaces. Let r denote its side-length of R] obviously, we have r < len(OPT). 

4. Let Gi be the union of and a shortest line segment connecting ^0 and R (if disjoint). 
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The graph Gi intersects all disks in I and possibly some disks in S\I. Our primary interest is 
in the disks in S that are disjoint from Gi. 



Figure 1: Left: a set of disks in the plane; the independent set I selected by a greedy algorithm is highlighted; 
a TSP tour for G ^.nd a minimum square R intersecting all disks are shown. Middle: Gi is the union of 
^ 0 , R, and a shortest line segment connecting them; only the disks in S 2 U S 3 that do not intersect Gi are 
shown. Right: a stratified grid for R and Gi. 


Lemma 1. For every disk D £ S, we have dist(Z),Gi) < diam(L)). 

Proof. Let D £ S. If Gi intersects D, then dist(Z),Gi) = 0, and the claim is trivial. Assume that 
D is disjoint from Gi. Since .^0 intersects every disk in I and fo G Gi, we have D £ S \ I. By the 
greedy choice of / C S', the disk D intersects some disk D' £ I oi equal or smaller radius, where 
Gi intersects D'. Consequently, dist(T>,Gi) < diam(L)') < diam(Z)). □ 

Connecting nearby disks to Gi. We partition S into three subsets: let Si be the set of disks 
in S that intersect Gi; let S 2 be the set of disks D £ S \ Si such that dist(T),Gi) < and let 
S 3 = S\(SiUS 2 ). Let G 2 be a graph that consists of IS 2 I line segments: specifically for every D £ S 2 , 
G 2 contains a shortest segment connecting D and Gi. Then len(G 2 ) = X]DeS 2 — 
1 * 52 ! • ^ < len(OPT). By construction, we have 

dist(T), Gi) > — for every D £ S 3 . (1) 

n 

By Lemma and inequality 0 we have 

Corollary 1. For every disk D £ S 3 , we have diam(Zl) > 

In the next section, we show how to find a geometric graph G 3 such that G 3 intersects every 
disk in S 3 and Gi U G 3 is connected (note, however, that G 3 need not be connected). 

4 Stratified Grids 

Recall that we have a geometric graph Gi, and a set S 3 of at most n disks in the interior of an 
axis-aligned square R of side-length r, r < len(OPT), satisfying 0. Let OPT(S 3 ,Gi) denote a 
geometric graph T of minimum length such that Gi U T is connected and intersects every disk in 
S 3 . Note that len(OPT(S 3 , Gi)) < len(OPT(S 3 )) < len(OPT), for every Gi. 

In Sections and we use hitting sets to compute a 0(l)-approximation of OPT(S 3 ,Gi). 
Similarly to a quadtree decomposition, we recursively construct a subdivision of R into squares of 
side-lengths r/2*, for i = 0,1,..., [logn]. Refer to Fig. [^(right). 
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Previously, Mitchell m used a similar quadtree decomposition for TSPN with disjoint regions 
in the plane, coined the term “stratified grid,” and derived several basic properties of quadtrees 
that we rederive here. Specifically, he proved analogues of Lemmas andfor the problem studied 
in [3^. However, Mitchell used stratified grids only for special types of orthogonal polygons, called 
histograms EH; here we generalize this tool to arbitrary geometric graphs. 

The following algorithm subdivides a square Q unless it is too small (i.e., diam((5) < ^) or it 
is relatively far from Gi (i.e., diam(( 5 ) < dist(Q, Gi)). 

Stratify (i?, Gi). Let L be a FIFO queue and Q be a set of axis-aligned squares. Set 
L = (R) and Q = 0. Repeat the following while L is nonempty. Set Q ^ dequeue(L). 

If diam((5) > max(^, dist((5, Gi)), then subdivide Q into four congruent axis-aligned 
squares, and enqueue them onto L. Otherwise, let Q Q U {Q}. Return Q. 

It is worth noting that Q does not directly depend on the disks in S 3 , but only indirectly, via Gi. 
By construction, the squares in Q are interior-disjoint, and every square in Q has diameter at least 
r/(4n). Consequently, the number of squares in Q is O(n^). Thus the algorithm Stratify(R, Gi) 
runs in polynomial time in n, since 0{in?‘) squares are enqueued onto L, and dist((5,Gi) can be 
computed in polynomial time for all Q G L. We show that the squares in Q have a property similar 
to the disks in ^3 (cf. Lemma [^: only larger squares can be farther from Gi. 

Lemma 2. For every square Q € Q, we have dist((5,Gi) < 3 diam(Q). 

Proof. Put q = diam(Q). Recall that Q is obtained by subdividing a square Q', Q C Q\ with 
diam((5') = 2q. Since Q' is subdivided by the algorithm, we have diam(Q') > max(^, dist((5', Gi)). 
Since dist(p', Q) < q for every point p' G Q', the triangle inequality yields dist((5, Gi) < 3q. □ 




Figure 2: Left: a square Q of the stratified grid, and the graph yjQ). Right: a polygonal curve a; the 
intersections of a with horizontal (resp., vertical) edges of the stratified grid are marked with empty (resp., 
full) dots. 

For every square Q £ Q, we define a graph 'y{Q) that consists of the boundary of Q and a shortest 
line segment from Q to Gi; see Fig. [^(left). By Lemmaj^ we have len( 7 (Q)) < {3 + 2\/2) diam((5); 
on the other hand, len(7((5)) > per((5) = 2^/2 diam.{Q), and so we have the following. 

Corollary 2. For every Q £ Q, we have len( 7 (Q)) = 0(diam((5)). 

The following observation is crucial for reducing the problem of approximating 0PT(S'3, Gi) to 
a minimum weight hitting set problem. 
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Lemma 3. If a square Q € Q intersects a disk D £ S 3 , then 

(i) diam(Q) < 2 diam(D), and 

(ii) D intersects the boundary of Q (and the graph jiQ) in particular). 

Proof, (i) Since Q £ Q, Algorithm Stratify(i?, Gi) did not subdivide Q, and so we have diam(Q) < 
^ or diam((5) < dist((5,Gi). If diam((5) < then Corollaryyields 

T T 

diam(Q) < — < — < diam(Z)) < 2diam(L)). 

2 n n 

If diam(Q) < dist(Q,Gi), then dist((5,Gi) < dist(D,Gi) + diam(Ii)) follows from the intersec¬ 
tion condition and the triangle inequality. Consequently, 

diam((5) < dist((5,Gi) < dist{D,Gi) + diam(L)) < 2 diam(D), 
where the last inequality holds by Lemma 

(ii) Suppose, to the contrary, that the boundary of Q is disjoint from D, hence D lies in the 
interior of Q. This immediately implies 

dist(( 5 ,Gi) < dist(T),Gi). ( 2 ) 

Since Q £ Q, Algorithm Stratify(i?, Gi) did not subdivide Q, and so we have diam((5) < ^ 

or diam((5) < dist{Q,Gi). If diam(Q) < Corollary yields diam(Q) < ^ < ^ < diam(L)). If 

diam((5) < dist(Q,Gi), then the combination of Q and Lemma[^yield 

diam(Q) < dist((5,Gi) < dist(Z),Gi) < diam(L)). 

In both cases, we have shown that diam((5) < diam(Z)). Therefore D cannot lie in the interior 
of Q, which contradicts the assumption. □ 


Recall that the squares in Q can only intersect at common boundary points; we call such squares 
adjacent. 

Lemma 4. If two squares Qi,Q 2 £ Q are adjacent and diam(Qi) < diam((52); then 

-diam((52) < diam(( 5 i) < diam((52) < 2 diam(Qi). 

Proof. If diam(Qi) = diam(( 52 ), the inequalities are satisfied. We may thus assume that diam((5i) < 
diam((52) = Q- Then Algorithm Stratify(i?, Gi) subdivided a square Q[ such that Qi C Qi, 
diam((5'^) < q, and Q[ n Q 2 / 0- The algorithm subdivided Q[ but did not subdivide Q 2 . This 
implies 

q > max dist(( 5 i, Gi)^ and q < max , dist(Q 2 , • 

The first inequality yields q > max(^, dist((5i, Gi)) > and then the second inequality yields 
q < max(^, dist(Q 2 ) Gi)) = dist((52, Gi). Consequently, dist(Q'^,Gi) <q< dist((52j Gi). 

Since Q[ and Q 2 intersect, and their diameters are at most q, the triangle inequality yields 
|dist(Q'^, Gi) — dist(Q 2 ) Gi)| < q. It follows that 

q < dist(( 52 , Gi) < dist((5i, Gi) +q <2q. 


Similarly, since Qi and Q 2 intersect, dist((5i,Gi) > dist((52,Gi) — diam(Qi) > q — diam((5i). 
Combining with Lemma we get 

3diam((5i) > dist((5i, Gi) > (7 — diam((5i). 


that is, diam(Qi) > g/4. Finally, recall that the ratio between the diameters of any two squares in 
Q is a power of 2. Therefore q/d < diam((5i) < q yields diam((5i) = q/2, as required. □ 
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5 Hitting Sets for Squares and Disks 


For the graph Gi and the set of disks S's, we define a hypergraph Q = {Q,E), where the vertex 
set is the set Q of squares in the stratified grid; and for every disk D £ S 3 , the set of squares in Q 
that intersect D forms a hyperedge in E. Thus, a subset Ti E Q of squares is a hitting set in the 
hypergraph Q if and only if every disk in S 3 intersects some square in %. 

For every hitting set %, the geometric graph F = UQg'^ 7 (( 5 ) intersects every disk in S 3 by 
Lemma and Gi U F is connected by construction. Let the weight of a square Q £ Qhe w{Q) = 
diam((5). In this section, we show that the minimum-weight hitting set for Q = {Q,E) is a 0(1)- 
approximation for 0 PT(S' 3 ,Oi). The following technical lemma considers a single curve (i.e., a 
Jordan arc). For a curve a, let Q.{a) denote the set of squares in Q that intersect a. Refer to 
Fig.[|(right). 

Lemma 5. Let a be a directed polygonal curve whose start and end points lie on Gi. If a intersects 
at least one disk in S 3 , then len(a) = I^(Z]q 6 Q(q,) diam(Q)). 

Proof. By Q, we have dist(0, Gi) > ^ for every disk D £ S 3 . Since a intersects at least one disk 
in S 3 , we have len(a) > 

Let A = {Qo, Qi,..., Qm) be the sequence of distinct squares that intersect a in the order in 
which they are first encountered by a (with no repetitions and ties broken arbitrarily). Since Qq 
intersects Gi, we have diam((5o) < ™ax(^,0) = and consequently 

2 r 

len(a) > — > 4diam((5o)- (3) 

n 

Let B = ((5o-(o)) • • •) Qa{i)) be the subsequence of A such that cr(0) = 0 and a square Qi, 

1 < i < m, is added to B if it is disjoint from Qj for all 0 < j < i. By construction, B consists 
of pairwise disjoint squares, and every square in A is either in B or adjacent to some square in B. 
By Lemma 1^ the sizes of adjacent squares in Q differ by a factor of at most 2. Consequently, each 
square in Q is adjacent to at most 12 squares in Q (at most two along each side and at most one 
at each corner). It follows that 


m / i 

diam(Qj) = 0 E diam(Q^(j)) 
i=0 \i=0 


(4) 


For j = 0,... ,i, let Pa{j) be the first intersection point of a with Qo-(jj■ For two points p,q £ a, 
denote by a{p, q) the portion of a between p and q. Since the squares in B are pairwise disjoint, 
and the sizes of adjacent squares differ by at most a factor of 2 (Lemma |^, we have 

len{a{p^y),p^y+i))) > \Paij)Paij+i)\ > max (diam(Q^(j)), diam(Q^Q+i))) 
for j = 0, ...,£ — 1. Consequently, if £ > 1, we have 


e-i 


j=0 


^diam(Q^Q)) . 

7=0 / 

(5) 

diam(( 5 i)), as required. 

□ 



Lemma 6. IfT is a geometric graph such that F intersects every disk in S 3 and GiUF is connected, 
then there is a hitting set % ^ Q for Q = (Q, E) such that 

len(r) = 0 I diam((5) j . (6) 

\Q&n / 

Proof. Let Ti be the set of squares in Q that intersect F, and observe that % is a, hitting set for 
Q. For each connected component C of F, let a be a directed polygonal curve that starts and 
ends at some points in Gi n C and traverses every edge of C at least once and at most twice. 
Then len(C) > ^len(a), and len(a) = ^^(X^QeSlo) diam(Q)) by Lemmaj^ Summation over all the 
components of F yields Q . □ 

Recall that OPT(S' 3 , Gi) is a geometric graph F of minimum length such that GiUF is connected 
and intersects every disk in S 3 . Let Wq denote the minimum weight of a hitting set in the hypergraph 
Q. The main result of this section is the following. 

Corollary 3. We have Wq = O(len(0PT(53, Gi))). 

Proof. Invoke Lemma with F = 0PT(S'3,Gi). Then Q has a hitting set PL C Q of weight 
diam((5) = 0(len(F)) = O(len(0PT(S'3, Gi))). This is clearly an upper bound on the 
minimum weight Wq of a hitting set in □ 


6 Hitting Sets for Points and Disks 


In Section we defined a hypergraph Q = {Q,E) for squares and disks; that is, the vertices are 
squares in Q and the hyperedges are the squares intersecting a disk in S 3 . In order to apply 
Theorem by Chan et al. m, we reduce the problem to a traditional geometric hypergraph 
problem, where the vertices are points in and a hyperedge corresponds to the set of points 
contained in a disk D £ S 3 . 

For each square Q £ Q, we define a set of 25 sentinel points, and show (Lemma that if a 
disk D £ S 3 intersects Q, then D contains one of the sentinel points of Q. A constant number of 
sentinels suffice if none of the disks intersecting Q is too small, and indeed. Lemma has shown 
that this is the case. 

For a square Q £ Q, where Q = [a, a+h] x \b, h+h], let the 25 sentinel points be {a+ih/2, b+jh/2) 
for all i,j £ {—1, 0,1, 2, 3}; see Fig. [^(left). 
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Figure 3: Left: the set of 25 sentinel points for a square Q. Right: a disk D G intersects a square Q G Q. 
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Lemma 7. If a disk D € S 3 intersects a square Q £ Q, then D contains a sentinel point corre¬ 
sponding to Q. 

Proof. Assume that a disk D £ S 3 intersects a square Q £ Q of side length h; refer to Fig. [^(right). 
By Lemma diam(Q) < 2diam(ii)). By scaling down D from an arbitrary center in D n Q, we 
find a disk D' intersecting Q with diam(D') = ^ diam(Q). The inscribed axis-aligned square Q' of 
D' has diam(Q') = | diam((5). That is, the side-length of Q' is /i/2, and dist(Q', Q) < {^/2 — l)h. 
Since the sentinels of Q form a section of a square lattice of (the same) side-length /i/ 2 , within 
distance ^ h from Q, some sentinel of Q lies in Q', and hence in D' C D, as claimed. □ 

We define a new weighted hypergraph Q' = {V',E'), where V' is the union of sentinel point sets 
for all Q G Q that lie in R (sentinels in the exterior of R are discarded); and each hyperedge in E' is 
the set of sentinels in V' contained in a disk D £ S 3 . Note that a sentinel s £V' may correspond to 
several squares in Q. Let the weight of a sentinel s G F' be the snm of the diameters of the squares 
Q £ Q that correspond to s. Hence the total weight of all sentinels is at most 25 diam(Q). 

We next derive a bound on the weight of each sentinel. 

Lemma 8. For every Q £ Q, the weight of every sentinel corresponding to Q is 0(diam((5)). 

Proof. By Lemma the side-lengths of adjacent squares of the stratified grid differ by at most a 
factor of 2. Consequently, every sentinel in V corresponding to a sqnare Q G Q is contained in Q 
or in a square of Q adjacent to Q. 

Let s G W be a sentinel. Then s may correspond to all squares in Q that contain s, and to 
adjacent squares in Q. Every point is contained in at most 4 squares of Q, whose side-lengths 
differ by a factor of at most 2 ; and they are each adjacent to 0 ( 1 ) additional sqnares whose side- 
lengths differ by another factor of at most 2 . Overall, s corresponds to 0(1) squares in Q whose 
side-lengths differ by a factor of 0(1). Therefore, the weight of s is 0(diam(Q)) for every square 
Q £ Q corresponding to s. □ 

By Theorem]^ there is a polynomial-time 0(l)-approximation algorithm for MWHS on Q' = 
iy', E'). It remains to show that a 0(l)-approximation for MWHS on the hypergraph Q' = {V\ E') 
provides a 0(l)-approximation for MWHS on the hypergraph Q = {Q,E). 

Lemma 9. 1. For every hitting set FL F Q for Q, the set H' of sentinels in V' corresponding to 

the squares Q £ PL is a hitting set for Q' of weight diam(Q)). 

2. For every hitting set H' C V' for Q', the set PL of squares Q £ Q that contain the sentinel 
points in H' is a hitting set for Q of weight w(s)}. 

Proof. (1) If is a hitting set for Q, then every disk D £ S 3 intersects some square Q £ PL. By 
Lemma D contains one of the sentinels of Q. Conseqnently, every disk D £ S 3 contains a sentinel 
in H'. Every square Q £ PL corresponds to 25 sentinels, each of weight 0(diam((5) by Lemma 
The weight of H' is 0(^Qg.^ diam((5)). 

(2) If H' is a hitting set for Q'^ then every disk D £ S 3 contains some point s £ H'. The point 
s lies in a square Q £ Q of the stratified grid, which is in "H. Conseqnently, every disk D £ S 3 
intersects some square Q £ PL. By construction, the weight of each sentinel s is the snm of weights 
of the corresponding sqnares in Q, including all squares in Q that contain s. Therefore, the weight 
of PL is at most Ylsew required. □ 

We are now ready to prove Theorem by analyzing the constrncted graph G = Gi U G 2 U G 3 . 
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Proof of Theorem\^ Let 5 be a set of n disks in Compute an independent set I C 5 as 
described in Section and a TSP tour for I with len(^o) = 0(len(0PT)) (as in | 6 ] or [19]). 
Compute the graph Gi with len(Gi) = 0(len(OPT)), and the partition S = S'iUS' 2 U 53 as described 
in Section The graph Gi intersects the disks in S*!. 

Construct the graph G 2 , which contains a shortest segment between Gi and every disk D G 82 - 
The length of this graph is len(G 2 ) = Y 1 d&S 2 dist(L>, Gi) < | 5 ' 2 | • ^ < r < len(OPT). 

Compute the stratified grid Q, and construct the weighted hypergraph Q' = where V' 

is the set of sentinel points for all squares Q G Q, the weight of a sentinel s is the sum of diameters 
of the corresponding squares Q G Q, and for every disk D G S 3 , the set of sentinels lying in D 
forms a hyperedge in E'. Use the algorithm by Chan et al. m to compute a hitting set H' for 
Q' whose weight is 0(1) times the minimum. Let % be the set of squares in Q containing the 
sentinels in H'. By Lemma|^ PL \s a. hitting set for the hypergraph Q = (Q, E) whose weight is at 
most 0(1) times the minimum Wq. Put G 3 = ^Q£HP{Q)- Then G 3 intersects every disk in S 3 by 
Lemmaand len(G 3 ) = 0(^Qg.^ diam(Q)) = ©(VPo) by Corollary!^ Finally, Corollary 1^ yields 
len(G 3 ) = 0 (len( 0 PT(,S 3 ,Gi))). By the definition of 0 PT(,S 3 , Gi)), we have len( 0 PT( 53 |Gi)) < 
len( 0 PT( 53 )) < len(OPT), and consequently len(G 3 ) = O(len(0PT)). 

Note that the graph G = Gi U G 2 U G 3 is connected by construction, it intersects every disk 
in S' = 5i U S '2 U S 3 , and len(G) = len(Gi) + len(G 2 ) + len(G 3 ) = 0(len(OPT)). Consequently, an 
Eulerian tour of the multi-graph containing each edge of G twice visits each disk and its length is 
21en(G) = O(len(0PT)), as required. 

Since the above steps as well as algorithm Stratify(i?, Gi) all run in time that is polynomial in 
n, the constant-factor approximation algorithm for TSP with disks runs in polynomial time. □ 

7 Conclusions 

In this paper, we obtained the first constant-ratio approximation for TSP with disks in the plane. 
This is the first result of this kind for a class of planar convex bodies of arbitrary size that can 
intersect in an arbitrary fashion. In light of the connection we established between TSPN and 
MWHS in geometric hypergraphs, the following question emerges: 

1. Besides regions of linear union complexity (e.g., disks and pseudo-disk^, what other types of 
regions admit a constant-factor approximation for the minimum weight hitting set problem? 

Obviously, a constant-factor approximation for MWHS with a certain type of neighborhoods 
does not automatically imply a constant-factor approximation for TSPN with the same type of 
neighborhoods. We conclude with a few, perhaps the simplest still unsolved questions on TSPN 
that we could identify: 

2. Is there a constant-factor approximation algorithm for TSP with a set of objects of linear 
union complexity, e.g., pseudo-disks? 

3. Is there a constant-factor approximation algorithm for TSP with convex bodies in the plane‘0 

4. Is TSP with disks in the plane APX-hard? Is TSP with convex bodies in the plane APX-hard? 

set of regions TZ consists of pseudo-disks, if every pair of regions u}i,uJ 2 GTZ satisfies the pseudo-disk property: 
the sets uji \ u >2 and UJ 2 \ wi are connected m p. 293]. Equivalently, the boundaries diOi and 9tU2 have at most two 
proper intersection points. 

^Very recently, Mitchell 1371 proposed a constant-factor approximation algorithm for this variant. However, no 
complete proof is available at the time of this writing. In fact, we believe that TSP with planar convex bodies is 
much harder to approximate than TSP with disks. 


II 



5. Is there a constant-factor approximation algorithm for TSP with balls (with arbitrary radii 
and intersections) in in fixed dimension d> 37 
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